assessment of internal validity of prognostic models through bootstrapping and multiple imputation of missing data

نویسندگان

mr baneshi

a talei

چکیده

background: prognostic models have clinical appeal to aid therapeutic decision making. two main practical challenges in development of such models are assessment of validity of models and imputation of missing data. in this study, importance of imputation of missing data and application of bootstrap technique in development, simplification, and assessment of internal validity of a prognostic model is highlighted. methods: overall, 310 breast cancer patients were recruited. missing data were imputed 10 times. then to deal with sensitivity of the model due to small changes in the data (internal validity), 100 bootstrap samples were drawn from each of 10 imputed data sets leading to 1000 samples. a cox regression model was fitted to each of 1000 samples. only variables retained in more than 50% of samples were used in development of final model. results: four variables retained significant in more than 50% (i.e. 500 samples) of bootstrap samples; tumour size (91%), tumour grade (64%), history of benign breast disease (77%), and age at diagnosis (59%). tumour size was the strongest predictor with inclusion frequency exceeding 90%. number of deliveries was correlated with age at diagnosis ( r =0.35, p <0.001). these two variables together retained significant in more than 90% of samples. conclusion: we addressed two important methodological issues using a cohort of breast cancer patients. the algorithm combines multiple imputation of missing data and bootstrapping and has the potential to be applied in all kind of regression modelling exercises so as to address internal validity of models.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Assessment of Internal Validity of Prognostic Models through Bootstrapping and Multiple Imputation of Missing Data

BACKGROUND Prognostic models have clinical appeal to aid therapeutic decision making. Two main practical challenges in development of such models are assessment of validity of models and imputation of missing data. In this study, importance of imputation of missing data and application of bootstrap technique in development, simplification, and assessment of internal validity of a prognostic mod...

متن کامل

Bootstrapping and Multiple Imputation Ensemble Approaches for Missing Data

Correspondence *Corresponding author Email: [email protected] Presence of missing values in a dataset can adversely affect the performance of a classifier; it deteriorates rapidly as missingness increases. Single and Multiple Imputation (MI) are normally performed to fill in the missing values. In this paper, we present several variants of combining MI and bootstrapping to create ensembl...

متن کامل

assessment of deep word knowledge in elementary and advanced iranian efl learners: a comparison of selective and productive wat tasks

testing plays a vital role in any language teaching program. it allows teachers and stakeholders, including program administrators, parents, admissions officers and prospective employers to be assured that the learners are progressing according to an accepted standard (douglas, 2010). the problems currently facing language testers have both practical and theoretical implications but the first i...

assessment of the efficiency of s.p.g.c refineries using network dea

data envelopment analysis (dea) is a powerful tool for measuring relative efficiency of organizational units referred to as decision making units (dmus). in most cases dmus have network structures with internal linking activities. traditional dea models, however, consider dmus as black boxes with no regard to their linking activities and therefore do not provide decision makers with the reasons...

Multiple Imputation for Missing Data

Multiple imputation provides a useful strategy for dealing with data sets with missing values. Instead of filling in a single value for each missing value, Rubin’s (1987) multiple imputation procedure replaces each missing value with a set of plausible values that represent the uncertainty about the right value to impute. These multiply imputed data sets are then analyzed by using standard proc...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
iranian journal of public health

جلد ۴۱، شماره ۵، صفحات ۱۱۰-۱۱۵

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023